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Abstract 

We consider the problem of determining whether a lacunary (also called a sparse or super-sparse) 
polynomial / is a perfect power, that is, / = h r for some other polynomial h and r £ N, and 
of finding h and r should they exist. We show how to determine if / is a perfect power in time 
polynomial in the number of non-zero terms of /, and in terms of log deg /, i.e., polynomial in the 
size of the lacunary representation. The algorithm works over ¥ q [x] (for large characteristic) and 
over Z[x], where the cost is also polynomial in log ||/||oo- We also give a Monte Carlo algorithm 
to find h if it exists, for which our proposed algorithm requires polynomial time in the output 
size, i.e., the sparsity and height of h. Conjectures of Erdos and Schinzel, and recent work of 
Zannier, suggest that h must be sparse. Subject to a slightly stronger conjectures we give an 
extremely efficient algorithm to find h via a form of sparse Newton iteration. We demonstrate 
the efficiency of these algorithms with an implementation using the C++ library NTL. 



1. Introduction 

In this paper we consider the problem of determining whether a polynomial / equals 
h r for some other polynomial h and positive integer r, and if so, finding h and r. The 
novel aspect of this current work is that our algorithms are efficient for the lacunary (also 
called sparse or supersparse) representation of polynomials. Specifically, we assume 

f = Yl Cj^ 5< G F[xi,...,^], (1.1) 

i<i<t 

where F is a field, cq, ■ ■ ■ , ct € F\{0}, and ei, . . . ,et € are distinct exponent tuples with 
< 1 1 <3 1 1 1 x < ••■ < Hetllj = deg/, where x?* is the monomial x^x^ 2 ■ ■ -x^ u of degree 
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II "Mil = 12i<j<£ e ij- We say / is t-sparse and write t(/) = t. We present algorithms 
which require time polynomial in r(/) and logdeg/. 

Computational work on lacunary polynomials has proceeded steadily for the past 
three decades. From the dramatic initial intractability results of Plaisted (1977, 1984), 
through progress in algorithms (e.g., Bcn-Or and Tiwari (1988), Shparlinski (2000), and 
Kaltofen and Lee (2003)) and complexity (e.g., Karpinski and Shparlinski (1999), Quick 
(1986), and von zur Gathen et al. (1993)), to recent breakthroughs in root finding and 
factorization (Cucker et al., 1999; Kaltofen and Koiran, 2006; Lenstra, 1999), these works 
have important theoretical and practical consequences. The lacunary representation is ar- 
guably more intuitive than the standard dense representation, and in fact corresponds to 
the default linked-list representation of polynomials in modern computer algebra systems 
such as Maple and Mathematica. 

We will always assume that t(/) > 2; otherwise f = x n , and determining whether 
/ is a perfect power is equivalent to determining whether n € N is composite, and to 
factoring n if we wish to produce r dividing n such that / = {x n / r ) r . Surprisingly, the 
intractability of the latter problem is avoided when t(/) > 2. 

We first consider detecting perfect powers and computing the power r for the univariate 
case 



where < ei < ei < ■ ■ ■ < et — deg /. 

Two cases for the field F are handled: the integers and finite fields of characteristic p 
greater than the degree of /. When / S Z[x], our algorithms also require time polynomial 
in logll/H^, where WfW^ = maxi<.j< t \a\ (for / g Q[x], we simply work with / = cf £ 
Z[x], for the smallest c <E Z\{0}). This reflects the bit-length of coefficients encountered in 
the computations. Efficient techniques will also be presented for reducing the multivariate 
case to the univariate one, and for computing a root h such that / = h r . 

1.1. Related work and methods 

Two well-known techniques can be applied to the problem of testing for perfect pow- 
ers, and both are very efficient when / = h'~ is dense. We can compute the squarcfrcc 
decomposition of / as in (Yun, 1976), and determine whether / is a perfect power by 
checking whether the greatest (integer) common divisor of the exponents of all non- 
trivial factors in the squarefree decomposition is at least 2. An even faster method (in 
theory and practice) to find h given / = h r is by a Newton iteration. This technique 
has also proven to be efficient in computing perfect roots of (dense) multi-precision in- 
tegers (Bach and Sorenson, 1993; Bernstein, 1998). In summary however, we note that 
both these methods require approximately linear time in the degree of /, which may be 
exponential in the lacunary size. 

Newton iteration has also been applied to finding perfect polynomial roots of lacunary 
(or other) polynomials given by straight-line programs. Kaltofen (1987) shows how to 
compute a straight-line program for h, given a straight-line program for / = h r and 
the value of r. This method has complexity polynomial in the size of the straight-line 
program for /, and in the degree of h, and in particular is effective for large r. We do not 
address the powerful generality of straight-line programs, but do avoid the dependence 
on the degree of h. 




(1.2) 



Ki<t 
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Closest to this current work, Shparlinski (2000) shows how to recognize whether / = h 2 
for a lacunary polynomial / 6 F q [a;]. Shparlinski uses random evaluations and tests for 
quadratic residues. How to determine whether a lacunary polynomial is any perfect power 
is posed as an open question. 

1.2. Our contributions 

Given a lacunary polynomial / <= Z[x] with r(f) > 2 and degree n, we first present 
an algorithm to compute an integer r > 1 such that / = h r for some h G Z[x], or 
determine that no such r exists. The algorithm requires 0~(t\og 2 \\f\\oc log 2 n) machine 
operations* , and is probabilistic of the Monte Carlo type. That is, for any input, on 
any execution the probability of producing an incorrect answer is strictly less than 1/2, 
assuming the ability to generate random bits at unit cost. This possibility of error can 
be made arbitrarily small with repeated executions. 

A similar algorithm is presented to answer Shparlinski's open question on perfect pow- 
ers of lacunary polynomials over finite fields, at least for the case of large characteristic. 
That is, when the characteristic p of a finite field F is greater than deg/, we provide 
a Monte Carlo algorithm that determines if there exists an h € F[x] and r such that 
/ = h r , and finds r if it exists, which requires 0~(t log 2 n) operations in F. 

An implementation of our algorithm over Z in NTL indicates excellent performance 
on sparse inputs when compared to a fast implementation based on previous technology 
(a variable-precision Newton iteration to find a power-series rth root of /, followed by a 
Monte Carlo correctness check). 

Actually computing h such that / = h r is a somewhat trickier problem, at least insofar 
as bounds on the sparsity of h have not been completely resolved. Conjectures of Schinzcl 
(1987) and recent work of Zannier (2007) suggest that, provided the characteristic of F is 
zero or sufficiently large, h is lacunary as well. To avoid this lack of sufficient theoretical 
understanding, we develop an algorithm which requires time polynomial in both the 
representation size of the input / (i.e., t(/), logn and log ||/||oo) and the representation 
size of the output (i.e., r(h) and log ||/||oo)- This algorithm works by projecting / into a 
sequence of small cyclotomic fields. Images of the desired h in these fields are discovered 
by factorization over an algebraic extension. Finally, a form of interpolation of the sparse 
exponents is used to recover the global h. The algorithm is probabilistic of the Monte 
Carlo type. While this algorithm is polynomial time, we do not claim it will be efficient 
in practice. Instead, we also present and analyze a simpler alternative based on a kind 
of Newton iteration. Subject to what we believe is a reasonable conjecture, this is shown 
to be very fast. 

The remainder of the paper is arranged as follows. In Section 2 we present the main 
theoretical tool for our algorithm to determine if / = h r , and to find r. We also show 
how to reduce the multivariate problem to the univariate one. In Section 3 we show how 
to compute h such that / = h r (given that such h and r exist). Finally, in Section 4, 
we present an experimental implementation of some of our algorithms in the C++ library 
NTL. 

An earlier version of some of this work was presented in the ISSAC 2008 conference 
(Giesbrecht and Roche, 2008). 



* We employ soft-Oh notation: for functions a and ip we say a £ 0~(<p) if u £ O (ip log c ip) for some 
constant e > 0. 
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2. Testing for perfect powers 

In this section we describe a method to determine if a lacunary polynomial f £ F[x] 
is a perfect power. That is, do there exist h £ F[x] and r > 1 such that f = h r ? The 
polynomial h need not be lacunary, though some conjectures suggest it may well have to 
be. We will find r, but not h. 

We first describe algorithms to test if an / € F[x] is an rth power of some polynomial 
h £ F[x], where / and r are both given and r is assumed to be prime. We present and 
analyze variants that work over finite fields ¥ q and over Z. In fact, these algorithms for 
given r are for black-box polynomials: they only need to evaluate / at a small number 
of points. That this evaluation can be done quickly is a property of lacunary and other 
classes of polynomials. 

For lacunary / we then show that, in fact, if h exists at all then r must be small unless 
f = x n . And if / is a perfect power, then there certainly exists a prime r such that / is 
an rth power. So in fact the restrictions that r is small and prime are sufficient to cover 
all nontrivial cases, and our method is complete. 

2.1. Detecting given rth powers 

Our main tool in this work is the following theorem which says that, with reasonable 
probability, a polynomial is an rth power if and only if the modular image of an evaluation 
in a specially constructed finite field is an rth power. 

Theorem 2.1. Let g £ Z be a prime power and r £ N a prime dividing g — 1. Suppose 
that / £ ¥ g [x] has degree n < 1 + ^/g/2 and is not a perfect rth power in F e [a;]. Then 

R { P = # {c £ ¥ e : f(c) £ ¥ e is an rth power} < ^. 
j 4 

Proof. The rth powers in ¥ g form a subgroup H of F* of index r and size (g — l)/r in F* . 

Also, a £ F* is an rth power if and only if a^ e_1 ^ / ' r = 1. We use the method of "completing 
the sum" from the theory of character sums. We refer to Lidl and Niederreiter (1983), 
Chapter 5, for an excellent discussion of character sums. By a multiplicative character 
we mean a homomorphism x '■ F* — > C which necessarily maps ¥ e onto the unit circle. As 
usual we extend our multiplicative characters x so that x(0) = 0, and define the trivial 
character x (a) to be when a = and 1 otherwise. 
For any a £ ¥*, 

1 v--* f 1 if a £ H, 

where x ranges over all the multiplicative characters of order r on F* — that is, all 
characters that are isomorphic to the trivial character on the subgroup H. Thus 



<> 



aGF* \ X r =Xo / X r =Xo aSF* 



x r =x 



aGF„ 
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Here we use the obvious fact that 



£x (/(a))< ^X (/(«))=e-rf<ft 



aew- 



ae¥„ 



where d is the number of distinct roots of / in ¥ e . We next employ the powerful theo- 
rem of Weil (1948) on character sums with polynomial arguments (see Theorem 5.41 of 
Lidl and Niederreiter (1983)), which shows that if / is not a perfect rth power of another 
polynomial, and \ has order r > 1, then 



E x(/(<0) 



aeF„ 



<(n~l)p 1/2 <f, 



using the fact that we insisted n < Summing over the r— 1 non-trivial characters 

of order r, we deduce that 



R (r) < Q r-1 g_ < 
f ~ r r 2 ~ 4 



□ 



Certifying specified powers over ¥ q [x] 

Theorem 2.1 allows us to detect when a polynomial / G F e [x] is a perfect rth power, 
for known r dividing g — 1: choose random a G F e and evaluate £ = f(ap e ~ 1 >' r G F e . 
Recall that £ = 1 if and only if /(a) is an rth power. 

• If / is an rth power, then clearly /(a) is an rth power and we always have £ = 1. 

• If / is not an rth power, Theorem 2.1 demonstrates that for at least 1/4 of the 
elements of ¥ g , f(a) is not an rth power. Thus, for a chosen randomly from ¥ g we 
would expect £ ^ 1 with probability at least 1/4. 

For a polynomial / G F g [x] over an arbitrary finite field ¥ q , where q is a prime power 
such that q — 1 is not divisible by r, we proceed by constructing an extension field F g r— i 
over Fq. From Fermat's Little Theorem and the fact that r\q, we know r\ (g r_1 — 1), 
and we can proceed as above. We now present and analyze this more formally. 

Algorithm IsPerf ectRthPowerGF 

Input: A prime power q, f G ¥ q [x] of degree n < 1 + yfq/2, r G N a prime dividing n, 
and e G M>o 

Output: True if / is the rth power of a polynomial in F e [x]; False otherwise. 
1: Find an irreducible T G ¥ q [z] of degree r — 1, successful with probability at least e/2 
2: Q <— q r ~ 1 

3: Define ¥ e =¥ q [z]/(T) 
4: m<- 2.5(1 +(Tog a (l/e)l) 
5: for j from 1 torn do 
6: Choose random a G F e 
7: £ «- /(a)^- 1 )/'' G F e 
8: if £ 7^ 1 then 
9: return False 
10: return True 



Notes on IsPerf ectRthPowerGF. 
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To accomplish Step 1, a number of fast probabilistic methods are available to find irre- 
ducible polynomials. We employ the algorithm of Shoup (1994). This algorithm requires 
0((r 2 logr + rlogq) log r log log r) operations in ¥ q . It is probabilistic of the Las Vegas 
type, and we assume that it always stops within the number of operations specified, and 
returns the correct answer with probability at least 1/2 and "Fail" otherwise (it never 
returns an incorrect answer). The algorithm is actually presented in Shoup (1994) as 
always finding an irreducible polynomial, but requiring expected time as above; by not 
iterating indefinitely our restatement allows for a Monte Carlo analysis in what follows. 
To obtain an irreducible T with failure probability at most e/2 we run (our modified) 
Shoup's algorithm 1 + |~log 2 (l/e)] times. 

The restriction that n < 1 + \[2 (or alternatively q > 4(rt — l) 2 ) is not problematic. If 
this condition is not met, simply extend ¥ q with an extension of degree v = \\og q (A(n — 
l) 2 )] and perform the algorithm over ¥ q u. At worst, each operation in requires 
0(M(logn)) operations in ¥ q . 

Here we define M(r) as a number of operations in F to multiply two polynomials 
of degree < r over F, for any field F, or the number of bit operations to multiply two 
integers with at most r bits. Using classical arithmetic M(r) is 0(r 2 ), while using the 
fast algorithm of Cantor and Kaltofen (1991) we may assume M(r) is 0(r logr log logr). 

Theorem 2.2. Let q be a prime power, / £ Fji], r S N a prime dividing deg/ and 
e > 0. If / is a perfect rth power the algorithm IsPerf ectRthPowerGF always reports 
this. If / is not a perfect rth power then, on any invocation, this is reported correctly 
with probability at least 1 — e. 

Proof. It is clear from the above discussion that the algorithm always works when / 
is perfect power. When / is not a perfect power, each iteration of the loop will obtain 
£ 7^ 1 (and hence a correct output) with probability at least 1/4. By iterating the loop 
m times we ensure that the probability of failure is at most e/2. Adding this to the 
probability that Shoup's algorithm (for Step 1) fails yields a total probability of failure 
of at most e. □ 

Theorem 2.3. On inputs as specified, the algorithm IsPerf ectRthPowerGF requires 
0((rM(r) logr log q) ■ log(l/e)) operations in F g plus the cost to evaluate a i— ► f(a) at 
0(log(l/e)) points a € ¥ qT -i. 

Proof. As noted above, Shoup's 1994 algorithm requires 0((r 2 log r+r log q) logr log logr) 
field operations per iteration, which is within the time specified. The main cost of the 
loop in Steps 4-8 is computing /(a)^ 1 )/ 1 ", which requires O(logp) or 0(rlogq) oper- 
ations in ¥ e using repeated squaring, plus one evaluation of / at a point in F e . Each 
operation in ¥ e requires 0(M(r)) operations in F g , and we repeat the loop 0(log(l/e)) 
times. □ 

Corollary 2.4. Given / € ¥ q [x] of degree n with r(/) = t, and r £ N a prime dividing 
n, we can determine if / is an rth power with 

O ((rM(r) log r log q + iM(r) log n) ■ log(l/e)) 

operations in ¥ q . When / is an rth power, the output is always correct, while if / is not 
an rth power, the output is correct with probability at least 1 — e. 



(» 



2.3. Certifying specified powers over 1\x] 



For an integer polynomial / € Z[x], we proceed by working in the homomorphic image 
of Z in ¥ p (and then in an extension of that field). We must ensure that the homomor- 
phism preserves the perfect power property we are interested in with high probability. For 
any polynomial g £ F[x], let disc(p) = res(/, /') be the discriminant of g (the resultant of 
/ and its first derivative). It is well known that g is squarefree if and only if disc(g) =/= 0. 
Also define lcocff(/) as the leading coefficient of /, the coefficient of the highest power 
of x in /. 

Lemma 2.5. Let / G Z[x] and / = //gcd(/, /') its squarefree part. Let p be a prime 
such that p\d\sc{f) and pflcocff(/). Then / is a perfect power in Z[x] if and only if 
/ mod p is a perfect power in F p [x] . 

Proof. Clearly if / is a perfect power, then / modp is a perfect power in Z[x]. To 
show the converse, assume that / = /-f 1 • ■ ■ /^™ for distinct irreducible fi,...,f m G Z[x], 
so / = fx ■ ■ ■ f m . Clearly / = fl 1 ■ ■ ■ f^™ mod p as well, and because p\ lcoeff(/) we 
know deg(/j modp) = deg/, for 1 < i < m. Since p{disc(/), / mod p is squarefree 
(see von zur Gathen and Gerhard (2003), Lemma 14.1), and each of the fi mod p must 
be pairwise relatively prime and squarefree for 1 < i < m. Now suppose / mod p is a 
perfect rth power modulo p. Then we must have r | Sj for 1 < i < m. But this immediately 
implies that / is a perfect power in Z[x] as well. □ 

Given any polynomial g = go + g\x + • • • + g m x m € Z[x], we define the height or 
coefficient oo-norm of g as ||<?||oo = max^ \gi\. Similarly, we define the coefficient 1-norm 

1/2 ~ 

of g as Hs-lli = J2> \9i\, and 2-norm as ||5|| 2 = (£\ l^l 2 ) . Since / divides /, we can 
employ the factor bound of Mignotte (1974) to obtain 

ll/lloc < 21/|| 2 < 2V^- ll/IU 

Since disc(/) = res(/, /') is the determinant of matrix of size at most (2n— 1) x (2n — 1), 
Hadamard's inequality implies 

|disc(/)| < (2"n 1 / 2 ||/|| 00 ) (2"t 1 3 / 2 ||/|| 0O ) < 2 2 ™ n 2n ■ \\f\\^. 

Also observe that |lcoeff(/)| < ||/||oo- Thus, the product disc(J) • lcocff(/) has at most 

/Llog 2 (4(n-l) 2 )j" 

prime factors greater than 4(n— l) 2 (we require the lower bound 4(n— l) 2 to employ The- 
orem 2.1 without resorting to field extensions). Choose a 7 > 4(n— l) 2 such that the num- 
ber of primes 7r(27) — ^(7) between 7 and 27 is at least 4// + 1. By Rosser and Schoenfeld 
(1962), 7r(2 7 ) - tt(7) > 27/(5^17) for 7 > 59. Thus if 7 > max{14/iln(14/i), 100}, then 
a random prime not equal to r in the range 7 ... 27 divides lcoeff (/) • disc(/) with prob- 
ability at most 1/4. Primes p of this size have only log 2 p E 0(logn + loglog ||/||oo) 
bits. 



log 2 2 



)2n„2n| 



/I 



2n+l 



7 



Algorithm IsPerf ectRthPowerZ 



Input: / £ X[x] of degree n; r £ N a prime dividing n; e £ K>o; 

Output: True if / is the rth power of a polynomial in Z[x]; False otherwise 



/' 



log; 



(2 2 "V™||/|| 



2n+l 



1) 2 )J 



/Llog 2 (4(n- 

7 <- max{14/iln(14/i),4(n- 1) 2 ,100} 
for i from 1 to . . . [log 2 (l/e)] do 

p <— random prime in the range 7 ... 27 
if NOT IsPerf ectRthPowerGF(p, / mod p, r, 1/4 ) then 
return False 
return True 



Theorem 2.6. Let / £ Z[ir] of degree n, r £ N dividing n and e <G R>o- If / is a perfect 
rth power, the algorithm IsPerf ectRthPowerZ always reports this. If / is not a perfect 
rth power, on any invocation of the algorithm, this is reported correctly with probability 
at least 1 — e. 



Proof. If / is an rth power then so is / mod p for any prime p, and so is any f(a) £ F p . 
Thus, the algorithm always reports that / is an rth power. Now suppose / is not an 
rth power. If p | disc(/) it may happen that / mod p is an rth power. This happens with 
probability at most 1/4 and we will assume that the worst happens in this case. When 
pjdisc(/), the probability that IsPerf ectRthPowerGF incorrectly reports that / is an 
rth power is also at most 1/4, by our choice of parameter e. Thus, on any iteration of 
steps 4-6, the probability of finding that / is an rth power is at most 1/2. The probability 
of this happening [~log 2 (l/e)] times is clearly at most e. □ 



Theorem 2.7. On inputs as specified, the algorithm IsPerf ectRthPowerZ requires 

0(rM(r) logr-MQogn + loglogH/IU) • (log 71 + log log H/H^) ■ Iog(l/e)), 

or 0~(r 2 (log n+log log ||/|| 00 ) 2 -log(l/e)) bit operations, plus the cost to evaluate (a,p) 1— > 
f(a) mod p at 0(log(l/e)) points a £ ¥ p for primes p with logp £ 0(\og n+log log ||/||oo)- 



Proof. The number of operations required by each iteration is dominated by Step 5, for 
which 0(rM(r) log r logp) operations in ¥ p is sufficient by Theorem 2.3. Since logp £ 
0(logn + log log ll/Hoo) we obtain the final complexity as stated. □ 



We obtain the following corollary for t-sparsc polynomials in Z[x]. This follows since 
the cost of evaluating a ^-sparse polynomial f £ Z[x] modulo a prime p is 0(t log ||/||oo logp+ 
t lognM(logp)) bit operations. 

Corollary 2.8. Given / £ Z[x] of degree n, with r(/) = t, and r £ N a prime dividing 
n, we can determine if / is an rth power with 

0~ ((r 2 log 2 n + t log 2 n + rlog H/IU logn) ■ Iog(l/e)) 

bit operations. When / is an rth power, the output is always correct, while if / is not 
an rth power, the output is correct with probability at least 1 — e. 
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2.4- An upper bound on r. 



In this subsection we show that if / = h r and / ^ x n then r must be small. Over 
T\x\ we show that \\h\\ 2 is small as well. A sufficiently strong result over many fields is 
demonstrated by Schinzel (1987), Theorem 1, where it is shown that if / has sparsity 
t > 2 then t > r + 1 (in fact a stronger result is shown involving the sparsity of h as 
well). This holds when either the characteristic of the ground field of / is zero or greater 
than deg /. 

Here we give a (much) simpler result for polynomials in Z[x], which bounds \\h\\ 2 and 
is stronger at least in its dependency on t though it also depends upon the coefficients 
of/. 

Theorem 2.9. Suppose / £ Z[x] with deg / = n and r(/) = t, and / = h r for some 
h £ 1[x] of degree s and r > 2. Then \\h\\ 2 < \\f\\\ /r . 

Proof. Let p > n be prime and £ £ C a pth primitive root of unity. Then 

\\h\\l= E i^i 2 = ^ E imoi 2 - 

0<i<s " 0<i<p 

(this follows from the fact that the Discrete Fourier Transform (DFT) matrix is orthogo- 
nal). In other words, the average value of |/i(0)| 2 for i = . . .p — 1 is \\h\\ 2 , and so there 
exists a k £ {0, . . .,p - 1} with \h(( k )\ 2 > \\h\\ 2 2 . Let 6 = ( k . Then clearly \h(6)\ > \\h\\ 2 . 
We also note that f{6) = h(6) r and \f(6)\ < Wf^, since \6\ = 1. Thus, 

IWI 2 < IM<?)I = \f(o)\ 1/r < \\f\\\ /r - n 

The following corollary is particularly useful. 

Corollary 2.10. If / £ Z[x] is not of the form x n , and / = h r for some h £ Z[x], then 

(i) r<21og 2 ||/|| 1 . 

(ii) r(h) < \\f\\l /r 

Proof. Part (i) follows since \\h\\ 2 > \/2. Part (ii) follows because \\h\\ 2 > yjr(h). □ 

These bounds relate to the sparsity of / since \\f\\ 1 < t(/)||/|| 00 . 

2.5. Perfect Power Detection Algorithm 

We can now complete the perfect power detection algorithm, when we are given only 
the i-sparse polynomial / (and not r). 
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Algorithm IsPerf ectPowerZ 

Input: / £ Z[x] of degree n and sparsity t > 2, e £ K >0 
Output: True and r if / = ft. r for some h £ Z[x] 
False otherwise. 

1: V <— {primes r \ n and r < 2 log 2 (t||/||oo)} 

2: for r £ V do 

3: if IsPerf ectRthPowerZ(/, r, e/#V) then 
4: return True and r 
5: return False 



Theorem 2.11. If / £ Z[x] = h r for some h £ Z[x], the algorithm IsPerf ectPowerZ 
always returns "True" and returns r correctly with probability at least 1 — e. Otherwise, 
it returns "False" with probability at least 1 — e. The algorithm requires 0~(t log 2 ||/||oo • 
log 2 (n) • log(l/e)) bit operations. 

Proof. From the preceding discussions, we can see that if / is a perfect power, then it 
must be a perfect rth power for some r £ V . So the algorithm must return true on some 
iteration of the loop. However, it may incorrectly return true too early for an r such that 
/ is not actually an rth power; the probability of this occurring is the probability of error 
when / is not a perfect power, and is less than e/^V at each iteration. So the probability 
of error on any iteration is at most e, which is what we wanted. 

The complexity result follows from the fact that each r £ 0(\ogt + log||/|| ) and 
using Corollary 2.8. □ 

For polynomials in ¥ q [x] we use Schinzel's bound that r < t — 1 and obtain the 
following algorithm. 

Algorithm IsPerf ectPowerGF 

Input: / <G ¥ q [x] of degree n and sparsity t, where the characteristic of F g is greater 

than n, and e £ K>o 
Output: True and r if / = h r for some h £ ¥ q [x]; 
False otherwise. 
1: V <— {primes r \ n and r < t} 
2: for p £ V do 

3: if IsPerf ectRthPowerGF( /, r, e/#V ) then 
4: return True and r; 



Theorem 2.12. If / = h r for h £ ¥ q [x], the algorithm IsPerf ectPowerGF always returns 
"True" and returns r correctly with probability at least 1— e. Otherwise, it returns "False" 
with probability at least 1 — e. The algorithm requires 0~(t 3 (log q + logn)) operations in 
F,. 

Proof. The proof is equivalent to that of Theorem 2.11, using the complexity bounds 
in Corollary 2.4. □ 
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2. 6. Detecting multivariate perfect powers 

In this subsection we examine the problem of detecting multivariate perfect powers. 
That is, given a lacunary / G F[xi,...,x^] of total degree n as in (1.1), we want to 
determine if / = h r for some h G F[xi, . . - ,X(\ and r G N. This is done simply as a 
reduction to the univariate case. 

First, given / G F[xi, . . . , xe\, define the squarefree part / G F[xi, . . . , xg\ as the 
squarefree polynomial of highest total degree which divides /. 

Lemma 2.13. Let / G F[xi, ...,xt] be of total degree n > and let / G F[xi, . . . , xg\ 
be the squarefree part of /. Define 

A = &&cx{f{yix, . . .,yex)) = reSx(f(yix, . . .,y e x) > f'(y 1 x, . . .,ytx)) G F[yi, ...,yi\ 

and 

A = lcoeff x (/(yix, . . . , y t x)) G F[y x , ...,yi\ 

for independent indeterminates x, yi, . . . , yt. Assume that oi, . . . , at € F with A(ai, . . . , at) 
and A(ai, . . . , a n ) ^ 0. Then f(xi, . . . , ) is a perfect power if and only if f[a\X, . . . , a^x) 
€ F[x] is a perfect power. 

Proof. Clearly if / is a perfect power, then /(aix, . . . , aex) is a perfect power. To prove 
the converse, assume that 

for irreducible fx, . . . , / m G F[xi, . . . , X(\. Then 

f(yix, y m x) = fi(yix, y m x) Sl ■ ■ ■ f m (yix, y m x) Sm 

and each of the fi(y\x, . . . , y m x) are irreducible. Now, since A(ai, . . . , a rn ) ^ 0, we know 
the deg(/(aix, . . . , aix)) = deg/ (the total degree of /). Thus, deg/i(aix, . . . , agx) = 
deg fi for 1 < i < t as well. Also, by our assumption, disc(/(aix, . . . , a^x)) ^ 0, so all of 
the fi{a\x, . . . , aex) are squarefree and pairwise relatively prime for 1 < i < k, and 

f(aix, . . .,a,£x) = /i(aix, . . . ,a £ x) Sl ■ ■ ■ f m (aix, . . . ,a e x) Sm . 

Assume now that /(aix, . . . , a^x) is an rth perfect power. Then r divides Si for 1 < i < m. 
This immediately imples that / itself is an rth perfect power. □ 

It is easy to see that the total degree of A is less than 2n 2 and the total degree of A is 
less than n, and that both A and A are non-zero. Thus, for randomly chosen ai, . . . , 
from a set S C F of size at least 8n 2 + 4n we have A(ai, . . . , at) = or A(ai, . . . , ai) = 
with probability less than 1/4, by Zippel (1979) or Schwartz (1980). This can be made 
arbitrarily small by increasing the set size and/or repetition. We then run the appropriate 
univariate algorithm over F[x] (depending upon the field) to identify whether or not / is 
a perfect power, and if so, to find r. 
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3. Computing perfect roots 

Once we have determined that / G F[x] is equal to h r for some h G f[x], the next 
task is to actually compute h. Unfortunately, as noted in the introduction, there are no 
known bounds on t{K) which are polynomial in r(f). 

The question of how sparse the polynomial root of a sparse polynomial must be (or 
equivalently, how dense any power of a dense polynomial must be) relates to some ques- 
tions first raised by Erdos (1949) on the number of terms in the square of a polynomial. 
Schinzcl extended this work to the case of perfect powers and proved that r(h r ) tends to 
infinity as r(h) tends to infinity (Schinzel, 1987). Some conjectures of Schinzel suggest 
that r(h) should be 0(r(/)). A recent breakthrough of Zannier (2007) show that r(h) is 
bounded by a function which does not depend on deg /, but this bound is unfortunately 
not polynomial in t(/). 

However, our own (limited) investigations, along with more extensive ones by 
Coppersmith and Davenport (1991), and later Abbott (2002), suggest that, for any h G 
F[x], where the characteristic of F is not too small, r(h) G 0(T(h r ) + r). We skirt this 
problem here by simply making our algorithms output sensitive; the time required is 
polynomial in the lacunary size of the input and the output. 

3.1. Computing rth roots in polynomial-time (without conditions) 

In this subsection we present an algorithm for computing an h such that / = h r given 
/ G 7L\x\ and r G Z and assuming that such an h exists. The algorithm requires time 
polynomial in t = r(/), log deg/, log||/||oo and a given upper bound fx > m = t{K). 
It is not conditional on any conjectures, but is probabilistic of the Monte Carlo type. 
That is, the computed polynomial h is such that h r = / with high probability. We 
will only demonstrate that this algorithm requires polynomial time. A more detailed 
analysis is performed on the (more efficient) algorithm of the next subsection (though 
that complexity is subject to a modest conjecture). 

The basic idea of the algorithm here is that we can recover all the coefficients in Q 
as well as modular information about the exponents of h from a homomorphism into a 
small cylotomic field over Q. Doing this for a relatively small number of cyclotomic fields 
yields h. 

Assume that (the unknown) h G Z[x] has form 

h = S~] biX di for bi, . . . , b m G Z\{0}, and < d\ < d 2 < • ■ ■ < d m , 

l<i<m 

and that p > 2 is a prime distinct from r such that 

p\ Yl (dj-di), and p\ Y[ (di + 1). 

l<z<j<m l<i<m 

Let ( p £ C be a pth primitive root of unity, and Q p = 1 + z + ■ ■ ■ + z 9 ^ 1 G Z[z] 
its minimal polynomial, the pth cyclotomic polynomial (which is irreducible in Q[z]). 
Computationally we represent Q(Cp) as Q[z]/($ p ), with Q v = z mod <j> p . Observe that 
Cp = (p lcmp for any k G Z, where kremp is the least non-negative residue of k modulo 
p. Thus 

H( P ) = h p {( p ) for h p = J2 b lX d ' rcmp eZ[x], 

1< i < m 
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and hp is the unique representation of fo(Cp) as a polynomial of degree less than p — 1. By 
our choice of p, none of the exponents of h are equivalent modulo p and all the exponents 
reduced modulo p are strictly less thanp— 1 (since our conditions imply di ^ (p— 1) mod p 
for 1 < i < to). This also implies that the coefficients of h p are exactly the same as those 
of h, albeit in a different order. 

Now observe that we can determine h p quite easily from the roots of 

r P (y) = 2/--/(C P )eQ(C P M 

These roots can be found by factoring the polynomial T p (y) in Q(Cp)[y], and the roots 
in C must be uj l h(£ p ) £ C for < i < r, where lu is a primitive rth root of unity. When 
r > 2, and since we chose p distinct from r, the only rth root of unity in Q(Cp) is 1. 
Thus T p (y) has exactly one linear factor, and this must equal to y — ft-(Cp) = y — h p (£ p ), 
precisely determining h p . When r = 2, we have 

r p (tf) = (y~ HC p ))(y + h(Q) = (y- h p (( p ))(y + h p (Q) 

and we can only determine h p (£ p ) (and h p and, for that matter, h) up to a factor of ±1. 
However, the exponents of h p and — h p are the same, and the ambiguity is only in the 
coefficients (which we resolve later). 

Finally, we need to perform the above operations for a sequence of cyclotomic fields 
Q(Cpi): Q(Cp 2 ); ■••) 'QKCpfc) such that the primes in V = {pi, . . . ,pk} allow us to recover 
all the exponents in h. Each prime p £ V gives the set of exponents of h reduced modulo 
that prime, as well as all the coefficients of h in Z. That is, from each of the computations 
with p G V wc obtain 

C = {bi , . . . , b m } and E p = {d\ remp, e?2 remp, . . . , d remp} , 

but with no clear information about the order of these sets. In particular, it is not 
obvious how to correlate the exponents modulo the different primes directly. To do this 
we employ the clever sparse interpolation technique of Garg and Schost (2008) (based on 
a method of Grigoricv and Karpinski (1987) for a different problem), which interpolates 
the symmetric polynomial in the exponents: 

g = (x — di)(x — cfe) • ■ ■ (x — d m ) £ Z[x]. 

For each p € V we compute the symmetric polynomial modulo p, 

g p = (x — (di remp)) (a; — (c?2 remp)) ■ ■ ■ (x — (d m remp)) = g mod p, 

for which we do not need to know the order of the exponent residues. We then determine 
g e Z[x] by the Chinese remainder theorem and factor g over Z\x\ to find the d\ , . . . , d rn £ 
Z. Thus the product of all primes in p £ V must be at least 2II3HOO to recover the 
coefficients of g uniquely. It is easily seen that 2||^|| oo < 2n m . 

As noted above, the computation with each p £ V recovers all the exponents of h in 
Z, so using only one prime p £ V, we determine the jth exponent of h as the coefficient 
of x dj rom P in hp for 1 < j < m. If r = 2 we can choose either of the roots of T p (y) (they 
differ by only a sign) to recover the coefficients of h. 

The above discussion is summarized in the following algorithm. 
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Algorithm ComputeRootAlgebraic 
Input: / G Z[x] as in (1.2) and r, fj. G N. 

Output: ft, G Z[x] such that / = ft r and r(/i) < fi, provided such an h exists. 
1: 7 «— smallest integer such that 27/5 log7 > 10/i 2 (log 2 n)(l + fi log 2 n). 
2: <— set of fc > mlog 2 n primes chosen uniformly at random from {7, . . . , 27}. 
3: for p G V do 

4: Represent Q(Cp) by Q[x]/($ p ), where $ p = 1 + z H h and Cp = z mod <F 

5: Compute f(Q = Ei< 4 < t rcmp e Q(C P ). 

6: ftp <— root of T p = y r — /(Cp) in Q(Cp), found by factoring T p over Q(( p )[y}. 

7: if deg hp > p — 1 or ft p has non-integer coefficients then 

8: return FAIL 

9: Write h p G Z[ a;] as X/i<j<m bi P x dzp . 
10: if ?Ti differs from previous values of m then 
11: return FAIL 

12: g p <— (x - dip) (a: - d 2p ) ■ • • (x— d mp ) G Z p [x]. 

13: Reconstruct g G Z[x] from {g p } p ^-p by the Chinese remainder algorithm. 
14: {di, d,2, ■ ■ ■ , d m } <— distinct integer roots of g G Z[a;]. 

15: Choose any p G "P. For 1 < j < m, let bj G Z be the coefficient of rcmp in ft p . 
16: Return ft = Ei< i<m ■ 



Theorem 3.1. The algorithm ComputeRootAlgebraic works as stated. It is probabilistic 
of the Monte Carlo type and returns the correct answer with probability at least 9/10 on 
any execution. It requires a number of bit operations polynomial in t = t(/), log deg/, 
log 11/11, and fi. 

Proof. In Step 1 we need to choose a set of primes V which are all good with sufficiently 
high probability, in the sense that for all p G V 

p = r- [] (dj-di)- II (* + l)^0 mod p. 

l<z<j<m l<2<m 

2 n 

It is easily derived that j3 < , which has fewer than log 2 f3 < /i log 2 n prime factors. 
We also need to recover g in Step 7, and ||<?|joo < n^, so we need at least 1 + log 2 ||<7|| < 
1 + /ilog 2 n primes. Thus, if V has at least 10/i 2 log 2 (n)(l + /Ulog 2 n), the probability of 
choosing a bad prime from V is at most 1/(10(1 + /xlog 2 n)). The probability of choosing 
a bad prime with (1 + p\og 2 n) choices is at most 1/10, and the probability that all 
the primes are good is at least 9/10. Numbers are chosen uniformly and randomly from 
{7, . . . , 27} and tested for primality, say by Agrawal et al. (2004). Correctness of the re- 
mainder of the algorithm follows from the previous discussion. Factoring the polynomials 
T p G Q(Cp)[j/] can be performed in polynomial time with the algorithm of, for example, 
Landau (1985), and all other steps clearly require polynomial time as well. □ 

3.2. Faster root computation subject to conjecture 

Algorithm ComputeRootAlgebraic is probabilistic of the Monte Carlo type and not 
of the Las Vegas type because we have no way of certifying the output — i.e. that h r = f 
for given lacunary ft, / G Z[x] — in polynomial time. One way to accomplish this would 
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be to simply compute h r by repeated squaring and comparing the result to /, but to do 
so in polynomial time would require bounds on the sparsity of each intermediate power 
T(h l ) for 2 < i < r based on t(K) and t(/). 

In fact, with such sparsity bounds we can actually derive a deterministic algorithm 
based on Newton iteration. This approach does not rely on advanced techniques such as 
factoring over algebraic extension fields, and hence will be much more efficient in practice. 
It is also more general as it applies to fields other than Z and to powers r which are not 
prime. 

Unfortunately, this algorithm is not purely output-sensitive, as it relies on the following 
conjecture regarding the sparsity of powers of h: 

Conjecture 3.2. For r, s £ N, if the characteristic of F is zero or greater than rs, and 
h £ F[x] with deg h = s, then 

r(h l mod x 2s ) < r(h r mod x 2s ) + r, i = 1, 2, . . . , r - 1. 

This corresponds to intuition and experience, as the system is still overly contrained 
with only s degrees of freedom. A weaker conjecture would suffice to prove polynomial 
time, but we use the stated bounds as we believe these give more accurate complexity 
measures. 

Our algorithm is essentially a Newton iteration, with special care taken to preserve 
sparsity. We start with the image of h modulo x, using the fact that /(0) = h(0) r , and 
at Step i = 1, 2, ... , [log 2 (deg h + 1)] , we compute the image of h modulo x l . 

Here, and for the remainder of this section, we will assume that f,h£ f[x] with degrees 
n and s respectively such that / = h r for reNat least 2, and that the characteristic of 
F is either zero or greater than n. As usual, we define t = r(/). We require the following 
simple lemma. 

Lemma 3.3.* Let k, £ € N such that I < k and k + £ < s, and suppose hi £ F[x) is the 
unique polynomial with degree less than k satisfying h\ = f mod x k . Then 

r(/i[ +1 mod x k+e ) < 2t(t + r). 



Proof. Let hi £ F[x] be the unique polynomial of degree less than I satisfying h\ + 
h 2 x k = h mod x k+l . Since h r = /, 

f = h r 1 +rh[- 1 h 2 x k modx k+i . 

Multiplying by hi and rearranging gives 

h r i +1 = hif -rfh 2 x k mod 

Because hi mod x k and h 2 mod x £ each have at most r{h) terms, which by Conjecture 3.2 
is less than t — r, the total number of terms in h[ mod x k+l is less than 2t(t — r). □ 

This essentially tells us that the "error" introduced by examining higher-order terms 
of h\ is not too dense. It leads to the following algorithm for computing h. 



* Lemma subject to the validity of Conjecture 3.2. 



15 



Algorithm ComputeRootNewton 



Input: / G F[x], r G N such that / is a perfect rth power 
Output: h G F[sc] such that / = h r 

l: u <— highest power of x dividing / 

2: /u <— coefficient of x u in / 

3: 5 «- //(/„!») 

4: ft-f- 1, fc-f- 1 

5: while kr < degg do 

6: I <— min{fc, (deg#)/r + 1 — k} 



hg — h r+1 mod x k+ 

a ' k 

rx K 

h <— h + {&/ g mod x )x k 

k^k + i 
b «— any rth root of / u in F 
return bhx w / r 



Theorem 3.4. If / G F[aj] is a perfect rth power, then ComputeRootNewton returns an 
h G F[x] such that h r = f. 

Proof. Let u, f u ,g be as defined in Steps 1-4. Thus / = f u gx u . Now let h be some rth 
root of /, which we assume exists. If we similarly write h = h v gx v , with h v G F and 
g G F[a:] such that g(0) = 1, then h r = hjg r x vr . Therefore /„ must be a perfect rth 
power in F, r\u, and g is a perfect rth power in F[x] of some polynomial with constant 
coefficient equal to 1. 

Denote by hi the value of h at the beginning of the ith iteration of the while loop. 
So hi = 1. We claim that at each iteration through Step 6, hi = g mod x k . From the 
discussion above, this holds for i = 1. Assuming the claim holds for all f = 1,2,... ,j, we 
prove it also holds for i = j + 1. 

From Step 8, hj+i = hj + {a/g mod x l )x k , where a is as defined on the jth iteration 
of Step 7. We observe that 

hjh^ = h r j +1 + rh r 3 (a/g mod x l )x k mod x k+e . 

From our assumption, K = f mod x k , and I < k, so we have 

hjh r j+1 ee h r j +1 + rax k = h^ 1 + hjf - h r + x = hjf mod x k+e 

Therefore hj +i = f mod x k+e , and so by induction the claim holds at each step. Since 
the algorithm terminates when kr > degg, we can see that the final value of h is an rth 
root of g. Finally, (bhx u ^ r ) = f u gx u = /, so the theorem holds. □ 

Theorem 3.5.^ If / G F[x] has degree n and t nonzero terms, then ComputeRootNewton 

uses O ((i + r) 4 log r log nj operations in F and an additional O ((t + r) 4 log r log 2 n) bit 
operations, not counting the cost of root-finding in the base field F on Step 10. 



t Theorem subject to the validity of Conjecture 3.2. 
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Proof. First consider the cost of computing h r+1 in Step 7. This will be accomplished 
by repeatedly squaring and multiplying by h, for a total of at most 2[log 2 (r + 1)J mul- 
tiplications. As well, each intermediate product will have at most t(/) + r < (t + r) 2 
terms, by Conjecture 3.2. The number of field operations required, at each iteration, is 
O ((i + r) A log r) , for a total cost of O ((i + r) 4 log r log n) . 

Furthermore, since k + £ < 2 1 at the i'th step, for 1 < i < log 2 n, the total cost in bit 
operations is less than 

(* + r ) 4 l °S2 ri€0((t + r) 4 log r log 2 n) . 

l<z<log 2 n 

In fact, this is the most costly step. The initialization in Steps 1-4 uses only 0(t) 
operations in F and on integers at most n. And the cost of computing the quotient on 
Step 8 is proportional to the cost of multiplying the quotient and dividend, which is at 
most 0{t(t + r)). □ 

When F = Q, we must account for coefficient growth. We use the normal notion of 
the size of a rational number: For a£Q. write a = a/b for a, b relatively prime integers. 
Then define H(a) = max{|a|, |6|}. And for / £ Q[x] with coefficients Ci, . . . , c t £ Q, write 
Tt(f) = m&xH(ci). 

Thus, the size of the lacunary representation of / £ Q[x] is proportional to t(/), deg /, 
and logTi(f). Now we prove the bit complexity of our algorithm is polynomial in these 
values, when F = Q. 

Theorem 3.6.^ Suppose / £ Q[x] has degree n and t nonzero terms, and is a perfect rth 
power. ComputeRootNewton computes an rth root of / using 0~ (t(t + r) 4 • logn ■ \ogH.(f)) 
bit operations. 

Proof. Let h G Q[x] such that h r = f, and let c £ Z>o be minimal such that ch £ Z[x]. 
Gaufi's Lemma tells us that c r must be the least positive integer such that c r f £ 1\x] as 
well. Then, using Theorem 2.9, we have: 

n(h) < ihu < ||ch|| a < (t\w\f\u^ < t^n(ff + ^ r . 

(The last inequality comes from the fact that the 1cm of the denominators of / is at most 

my.) 

Hence \ogH(h) £ O ((t\ogH(f))/r). Clearly the most costly step in the algorithm 
will still be the computation of h^ +1 at each iteration through Step 7. For simplicity in 
our analysis, we can just treat hi (the value of h at the zth iteration of the while loop 
in our algorithm) as equal to h (the actual root of /), since we know r(hi) < r(h) and 
HQh) < H(h). 

Lemma 3.3 and Conjecture 3.2 tell us that r(/i z ) < 2{t + r) 2 for i = 1, 2, . . . , r. To 
compute h r+ , we will actually compute (ch) r+1 £ Z[x] by repeatedly squaring and 
multiplying by ch, and then divide out c r+1 . This requires at most [log 2 ^ + lj squares 
and products. 

Note that IIWI^ < (t + r) 2 \\(chY\\l and ||( C / l ) 4+1 || 00 < (t + r) 2 || WHJIc/^. 
Therefore 

HWlL^Ct + r^H^, i = l,2,. ..,r, 
and thus log || (c/i)* || a>0 £ O (r(t + r) + t log ?<(/)), for each intermediate power (ch) 1 . 



17 



Thus each of the O ((£ + r) 4 logr) field operations at each iteration costs 0(M(t log H(f) + 
log r(t + r))) bit operations, which then gives the stated result. □ 



The method used for Step 10 depends on the field F. For F = Q, we just need to find 
two integer perfect roots, which can be done in "nearly linear" time by the algorithm of 
Bernstein (1998). Otherwise, we can use any of the well-known fast root-finding methods 
over F[x] to compute a root of x r — f u . 





Fig. 1. Comparison of Newton Iteration (left) vs. our IsPerf ectPowerZ (right). Inputs are dense. 




Fig. 2. Comparison of Newton Iteration (left) vs our IsPerf ectPowerZ (right). Inputs are sparse, 
with sparsity fixed around 500. 



4. Implementation 

To investigate the practicality of our algorithms, we implemented IsPerf ectPowerZ 
using Victor Shoup's NTL. This is a high-performance C++ for fast dense univariate 
polynomial computations over Z[x] or F 9 [a;]. 

NTL does not natively support a lacunary polynomial representation, so we wrote our 
own using vectors of coefficients and of exponents. In fact, since IsPerf ectPowerZ is a 
black-box algorithm, the only sparse polynomial arithmetic we needed to implement was 
for evaluation at a given point. 

The only significant diversion between our implementation and the algorithm specified 
in Section 2 is our choice of the ground field. Rather than working in a degree-(r — 1) 
extension of F p , we simply find a random p in the same range such that (r — 1) | p. It 
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is more difficult to prove that we can find such a p quickly (using e.g. the best known 
bounds on Linnik's Constant), but in practice this approach is very fast because it avoids 
computing in field extensions. 

As a point of comparison, we also implemented the Newton iteration approach to 
computing perfect polynomial roots, which appears to be the fastest known method for 
dense polynomials. This is not too dissimilar from the techniques from the previous sec- 
tion on computing a lacunary rth root, but without paying special attention to sparsity. 
We work modulo a randomly chosen prime p to compute an rth perfect root h, and then 
use random evaluations of h and the original input polynomial / to certify correctness. 
This yields a Monte Carlo algorithm with the same success probability as ours, and so 
provides a suitable and fair comparison. 

We ran two sets of tests comparing these algorithms. The first set, depicted in Figure 
1, does not take advantage of sparsity at all; that is, the polynomials are dense and have 
close to the maximal number of terms. It appears that the worst-case running time of 
our algorithm is actually a bit better than the Newton iteration method on dense input, 
but on the average they perform roughly the same. The lower triangular shape comes 
from the fact that both algorithms can (and often do) terminate early. The visual gap in 
the timings for the sparse algorithm comes from the fact that exactly half of the input 
polynomials were perfect powers. It appears our algorithm terminates more quickly when 
the polynomial is not a perfect power, but usually takes close to the full amount of time 
otherwise. 

The second set of tests, depicted in Figure 2, held the number of terms of the perfect 
power, t(/), roughly fixed, letting the degree n grow linearly. Here we can see that, for 
sufficiently sparse /, our algorithm performs significantly and consistently better than 
the Newton iteration. In fact, we can see that, with some notable but rare exceptions, 
it appears that the running time of our algorithm is largely independent of the degree 
when the number of terms remains fixed. The outliers we see probably come from inputs 
that were unluckily dense (it is not trivial to produce examples of h r with a given fixed 
number of nonzero terms, so the sparsity did vary to some extent). 

Perhaps most surprisingly, although the choices of parameters for these two algorithms 
only guaranteed a probability of success of at least 1/2, in fact over literally millions of 
tests performed with both algorithms and a wide range of input polynomials, not a single 
failure was recorded. This is of course due to the loose bounds employed in our analysis, 
indicating a lack of understanding at some level, but it also hints at the possibility of a 
deterministic algorithm, or at least one which is probabilistic of the Las Vegas type. 

Both implementations are available as C++ code downloadable from the second au- 
thor's website. 
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